Picture for Dongmei Jiang

Dongmei Jiang

EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization

Add code
May 28, 2026
Viaarxiv icon

JFAA: Technical Report for the EPIC-KITCHENS-100 Action Anticipation Challenge at EgoVis 2026

Add code
May 20, 2026
Viaarxiv icon

VISTA: Technical Report for the Ego4D Short-Term Object Interaction Anticipation at EgoVis 2026

Add code
May 20, 2026
Viaarxiv icon

Efficient Adversarial Training via Criticality-Aware Fine-Tuning

Add code
Apr 14, 2026
Viaarxiv icon

AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba

Add code
Mar 19, 2026
Viaarxiv icon

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Add code
Feb 22, 2026
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Add code
Jun 12, 2025
Figure 1 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 2 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 3 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 4 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Viaarxiv icon

Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection

Add code
May 28, 2025
Figure 1 for Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
Figure 2 for Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
Figure 3 for Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
Figure 4 for Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
Viaarxiv icon

Open-Det: An Efficient Learning Framework for Open-Ended Detection

Add code
May 27, 2025
Viaarxiv icon